Improved Generative Adversarial Network Method for Flight Crew Dialog Speech Enhancement
نویسندگان
چکیده
Traditional speech enhancement algorithms are only suitable for dealing with stationary noise, but the noise in stage of flight is nonstationary so traditional method not flight. This paper proposes a algorithm based on generative adversarial network: Deep Convolutional–Wasserstein Generative Adversarial Network (DWGAN). Firstly, model integrates deep convolutional network and Wasserstein distance network. Secondly, it introduces conditional to improve enhanced quality, spectral constraint layer used prevent from falling too fast causing collapse. Finally, L1 loss term introduced into function reduce number training times further quality. The experimental results show that intrusiveness background overall processed quality DWGAN improved by about 7.6 9.4%, respectively, compared WGAN acoustic environment simulated aircraft operation.
منابع مشابه
SEGAN: Speech Enhancement Generative Adversarial Network
Current speech enhancement techniques operate on the spectral domain and/or exploit some higher-level feature. The majority of them tackle a limited number of noise conditions and rely on first-order statistics. To circumvent these issues, deep networks are being increasingly used, thanks to their ability to learn complex functions from large example sets. In this work, we propose the use of ge...
متن کاملLanguage and Noise Transfer in Speech Enhancement Generative Adversarial Network
Speech enhancement deep learning systems usually require large amounts of training data to operate in broad conditions or real applications. This makes the adaptability of those systems into new, low resource environments an important topic. In this work, we present the results of adapting a speech enhancement generative adversarial network by finetuning the generator with small amounts of data...
متن کاملWasserstein Generative Adversarial Network
Recent advances in deep generative models give us new perspective on modeling highdimensional, nonlinear data distributions. Especially the GAN training can successfully produce sharp, realistic images. However, GAN sidesteps the use of traditional maximum likelihood learning and instead adopts an two-player game approach. This new training behaves very differently compared to ML learning. Ther...
متن کاملControllable Generative Adversarial Network
Although it is recently introduced, in last few years, generative adversarial network (GAN) has been shown many promising results to generate realistic samples. However, it is hardly able to control generated samples since input variables for a generator are from a random distribution. Some attempts have been made to control generated samples from GAN, but they have shown moderate results. Furt...
متن کاملExploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition
We investigate the effectiveness of generative adversarial networks (GANs) for speech enhancement, in the context of improving noise robustness of automatic speech recognition (ASR) systems. Prior work [1] demonstrates that GANs can effectively suppress additive noise in raw waveform speech signals, improving perceptual quality metrics; however this technique was not justified in the context of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of aerospace information systems
سال: 2023
ISSN: ['1940-3151', '2327-3097']
DOI: https://doi.org/10.2514/1.i011168